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Abstract. The cops and robbers game has been extensively studied under the assumption of 
optimal play by both the cops and the robbers. In this paper we study the problem in which 
cops are chasing a drunk robber (that is, a robber who performs a random walk) on a graph. 
Our main goal is to characterize the "cost of drunkenness." Specifically, we study the ratio of 
' expected capture times for the optimal version and the drunk robber one. We also examine the 

04 . algorithmic side of the problem; that is, how to compute near-optimal search schedules for the 

f~| ' cops. Finally, we present a preliminary investigation of the invisible robber game and point out 

^ , differences between this game and graph search. 
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1. Introduction 



The game of Cops and Robbers, introduced independently by Nowakowski and Winkler 
and Quilliot [19] almost thirty years ago, is played on a fixed undirected, simple, and finite graph 
G. There are two players, a team of k cops, where > 1 is a fixed integer, and the robber. In 
^ I the first round of the game, the cops occupy any set of k vertices and then the robber chooses 
O, ' a vertex to start from; in the following rounds, first the cops and then the robber move from 
vertex to vertex, following the edges of G. More than one cop is allowed to occupy a vertex, 
and the players may remain on their current positions. At every step of the game, both players 
■ know the positions of all cops and the robber. The cops win if they capture the robber; that is, 
if at least one of cop eventually occupies the same vertex as the robber; the robber wins if he 
can avoid being captured indefinitely. The players are adversarial; that is, they play optimally 
against each other. Since placing a cop on each vertex guarantees that the cops win, we may 
O ' define the cop number, written c{G), to be the minimum number of cops needed to win on G. 
The cop number was introduced by Aigner and Fromme in [1]. 

In this paper we study a new version of the game, in which the robber is drunk; that is, he 
performs a random walk on G. The cops are assumed to follow a strategy which is optimal 
^ ■ with respect to the robber's random behavior. This version was proposed by D. Thilikos during 
. the 4th Workshop on GRAph Searching, Theory and Applications (GRASTA 2011) and he 
specifically asked the following question: "what is the cost of drunkenness^" In other words, 
how much faster than the adversarial robber is the drunk one captured? We try to answer various 
versions of this question. In addition, we study some algorithmic questions; for example, how 
to compute the expected capture time for an optimal strategy of cops. 

There is a large bibliography on pursuit games on graphs. The reader interested in cops 
and robbers can start by perusing the surveys [21 [71 |8] and the recent book [1]. To the best 
of our knowledge, the problem of a drunk robber has not been previously studied in the cops 
and robbers literature. However there is a strong connection to the Markov Decision Processes 
(MDP) literature; we will comment on this connection (and use it) in Sectional The reader can 
refer to [HI [ISl ED] for MDP surveys. 

While the emphasis of the current paper is on cops chasing the visible robber, we also touch 

briefly the case of invisible robber, both adversarial and drunk. Not much has been written 
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on this problem, but a related problem which has been extensively studied is the Graph Search 
problem, where a team of searchers try to locate in a graph an invisible fugitive, who is also 
assumed to be arbitrarily fast and omniscient (he always knows the searchers' locations as well 
as their strategy). A recent comprehensive review of graph search appears in [7j. We emphasize 
that the graph search problem is similar but not identical to cops chasing an invisible robber. 

The paper is structured as follows. In Section [2] we present definitions and our notation; the 
formulation is, naturally, probabilistic. In particular, we define the cost of drunkenness to be 
the ratio of the capture time for the adversarial robber and the expected capture time for the 
drunk robber. We also present a number of lemmas which we will repeatedly use in the following 
sections. In Section [3] we obtain bounds on the cost of drunkenness for various special families 
of graphs; for example, paths, cycles, grids, and complete d-aiy trees. In Section H] we look 
at the problem more generally and show that, for any c G [l,oo), there is a graph for which 
the cost of drunkenness is arbitrarily close to c. In Section O we connect the cops and drunk 
robber problem to Markov Decision Processes (MDP); that is, Markov chains with a control 
input which can modify the transition probabilities. MDP's provide a natural language for the 
problem; in particular they are useful in the computation of optimal cop strategies; that is, 
strategies which minimize the expected robber capture time. We then use the MDP machinery 
to present algorithms which compute the optimal cop strategy for a given graph and a drunk 
robber. In Section E] we give a brief, preliminary discussion of the cost of drunkenness for an 
invisible robber. Finally, in Section [7] we list possible future research directions. 

2. Preliminaries 

2.1. Definitions. Let G = {V,E) be a fixed undirected, simple, and finite graph. Since 
the game played on a disconnected graph can be analyzed by investigating each component 
separately, we assume that G is connected. We will use the following notation and assumptions. 

(i) There are k cops (for the time being we assume k > c{G) but this assumption will be 
relaxed in later sections). 

(ii) XI denotes the position of the i-th cop at time t {i G {1, 2, ... , k}, t G {0, 1,2,.. .}); 
Xt = {X^, . . . , X^) denotes the vector of all cop positions at time t; X = {Xq, Xi, X2, . . .) 
denotes the positions of all cops during the game (X may have finite or infinite length). 

(iii) Yt denotes the position of the robber at time t and Y = (Yq, Yi, ^2, • • •) the positions of 
the robber during the game. (Let us note that there is a correlation between X and Y; 
that is, players adjust their strategies observing moves of the opponent.) 

(iv) The moving sequence is as follows: first the cops choose initial positions Xq G V, then 
the robber chooses Yq G V. For t G {1,2, . . .} first the cops choose Xt and then the 
robber chooses Yf. Players use edges of the graph G to move from vertex to another one; 
that is, {Xi, Xi^^} e E for i e {l,2,...,k} and t G {0, 1,2,.. .}, and {Yt, Yt+i} G E for 
tG{0,l,2,...}. 

(v) The capture time is denoted by T and defined as follows 

T = min{t : 3i such that X^ = Yt}; 

that is, it is the first time a cop is located at the same vertex as the robber (note that 
this can happen either after the cops move or after the evader moves). Note that T < 00, 
since k > c{G) and c{G) cops can capture the adversarial robber (and so, of course, the 
drunk one too). 
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Assuming for the moment adversarial cops and robber, and given initial cop positions x E V'' 
and robber position y E V, we let ctcc,y{G, k) = T. The k-capture time is defined as follows: 

ct{G,k) — min maxctx y{G, k). 
xev'<= yev ' 

In other words, we allow our perfect players to choose their initial positions in order to achieve 
the best outcome. Finally, when k = c{G) we simply write ct(G) instead of ct(G, c(G)), and call 
it the capture time instead of c(G')-capture time. Let us stress one more time that the above 
quantities arc defined under the assumption of optimal play by both players. 

Next let us assume that the cops are adversarial but the robber is drunk. More specifically, 
we assume the robber performs a random walk on G. Given that he is at vertex v & V at time 
t, he moves to w e N{v) at time {t + 1) with probability equal to l/\N{v)\. Note that we do 
not include v in N{v); that is, we consider open, not closed, neighbourhoods. Moreover, the 
robber probability distribution does not depend on current position of cops; in particular, it can 
happen that the robber moves to a vertex occupied by a cop (something the adversarial robber 
would never do). 

Under the above assumptions, the drunk robber game is actually a one-player game and, for 
given initial configuration and cops strategy, the capture time T is a random variable. For any 
X eV'' and y eV, let 

dct^ {G, k) = K{T I Xq = x,Yq = y, k cops are used optimally) ; 

in other words, it is the expected capture time given initial cops and robber configurations x, y 
and optimal play by the k cops. 

Since the robber is drunk, we cannot expect him to choose the most suitable vertex to start 
with — instead, he chooses an initial vertex uniformly at random. Cops are, of course, aware of 
this and so they try to choose an initial configuration so that the expected length of the game 
is as small as possible. Hence, we define the expected fc-capture time as follows: 



a;gyfc \V\ 
yev I I 



As before, dct(G') = dct(G, c{G)). We define the cost of drunkenness as follows 

ct{G) 



F{G) 



dct(G') 



and we obviously have F{G) > 1. 

While we concentrate on the case k = c(G), it is also natural to consider expected capture 
time dct(G, k) for k ^ c{G). The next theorem shows that this is well defined for any A; > 1 (in 
particular, even for k < c{G)). 

Theorem 2.1. dct(G', k) < oo for any connected graph G and k > 1. 

Proof. Let G = {V, E) be any connected graph, D = D{G) be the diameter of G, and A = A(G') 
be the maximum degree of G. Fix any vertex v & V , place k cops on v, and let XI — v for 
all i and t (that is, cops never move; this is clearly a suboptimal strategy). For a given vertex 
y &V occupied by the drunk robber, the probability that he uses a shortest path from y to v 
to move straight to v is at least (1/A)^. This implies that, regardless of the current position of 
the robber at time t, the probability that he will be caught after at most D further rounds is at 
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least e = (l/A)^. Moreover, corresponding events for times t + iD, i E NU {0} are mutually 
independent. Thus, we get immediately that 



t>o 



t>o 





t 


(t > 




D_ 



D 



-FiT > iD) < 



i>0 



i>0 



D 

e 



DA^ < oo, 



and we are done. 



□ 



Let us remark that sharper bounds can be obtained for the capture time of a drunk robber, 
even in the case that the cops are also drunk; for example see [5] . However, Theorem 12.11 will 
be sufficient for our needs. 



2.2. Some Useful Lemmas. We will be using the following version of a well-known Chernoff 
bound many times so let us state it explicitly. 

Lemma 2.2 ([Hj). Let X be a random variable that can be expressed as a sum X = Y17=i -^i ^/ 
independent random indicator variables where Xi G Be(pj) with (possibly) different pi = P(Xj = 
1) = EXj. Then the following holds for t >0: 

P(X>EA- + «)<exp(-5^g^?^), 

P(A-<E.Y-«)<exp(-5|^), 

In particular, if e < 3/2, then 

P(|X -EX| > eEX) < 2exp 

Let us now consider the following (simple) random walk on Z. Understanding the behaviour 
of this Markov chain will be important in investigating simple families of graphs later. Let 
Xq = 0, and for a given t > 0, let 

^ _\Xt + l with probability 1/2 
\Xt — 1 otherwise. 

It is known that with high probability, random variable Xt stays relatively close to zero. We 
make this precise below using the Chernoff bound. 

Lemma 2.3. Let n G N and c G (2, oo). For a simple random walk {Xt) on Z with Xq = we 
have that \Xt\ < c^Jn\ogn for every t G {0, 1, . . . , n} with probability at least 1 — 2n^~'^ . 

Proof. Fix 72 G N and c G (2, oo). Let us perform n steps of a simple random walk on Z starting 
with Xq = 0. Let Yt {1 <t < n) denote the number of times the process goes 'up' until time t. 
It is clear that Ey^ = t/2 and 



e^EX 



Xt = Yt-{t-Yt) = 2{Yt-t/2). 
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For a given t, it follows from Chernoff bound (Lemma I2.2p that 

P (Xt < -c^nlogn) = T'(Yt<^- ^^nlogn^ 



exp 



{c\/n logri/2)^ 



2(t/2) 

< exp ^— — logra^ = n^^^/^ 



A symmetric argument can be used to get that Xt > c^/n\ogn with probability at most n 
Finally, from a union bound we get that the probability that there exists t {1 < t < n) with 
\Xt\ > Cy/ n log n is at most n ■ 2n~^ = 2n^^^ □ 

3. Bounds on the Cost of Drunkenness 

In this section we place upper and lower bounds on the cost of drunkenness F{G) when k 
cops are available. We emphasize the case k = c{G) but also consider values of 7^ c{G). We 
start with simple graphs (namely: paths, cycles, trees, and grids) in order to prepare for slightly 
more complicated families in the next section. 

3.1. Paths and a Suboptimal Strategy. In this subsection we play the game on P„, a path 
on n vertices (V(P„) = {0, 1, . . . , n - 1}, E(P„) = {{i - : i e {1,2, . . . ,n - 1}}). Clearly, 
c(P„) = 1; that is, one cop can catch the adversarial robber. Since the drunk robber is easier to 
catch than the adversarial one, let us study the drunk robber playing against a single cop. 

In this subsection we will compute the expected capture time using a suboptimal strategy, 
namely starting the cop at Xq = and moving him to the other end until he reaches n — 1 (or 
until capture takes place). It is clear that this strategy achieves capture; furthermore (as will 
become apparent in the following sections) many optimal strategies can be analyzed using this 
suboptimal one. 

Let Zf = Yf — Xt be the distance between players at time t. If the drunk robber starts at 
vertex k G {0, 1,. . . ,n — 1}, we have Zq = Yq = k. (In order to simplify the argument, we 
allow players to "pass each other" which is never the case in the real game; that is, Zt can be 
negative.) We can redefine the capture time as 

T„ = Tn{k) = mm{t :Zt<0}. 

Now, it is not so difficult to see the behaviour of the sequence {Zt)t>o- Note that at time t, the 
maximum distance between players is n — 1 — t which implies that the robber will be caught in 
at most n — 1 steps. We have the following Markov chain to investigate: for t G {0, 1, . . . , n — 2}, 
if Zt < n — 1 — t, then 



Zt 



+1 



2 with probability 1/2 (the robber goes toward the cop) 
with probability 1/2 (the robber goes away from the cop). 



If Zt = n — 1 — t (that is, the robber occupies the end of the path), then Zt+i = Zt — 2 
(deterministically) . 

Consider another Markov chain Z'^, which has the following simple behaviour: Z'q = k and 
for every t > 0, Z'^^^ = Z[ — 2 with probability 1/2; otherwise Z[^^ = Z[. Define T' = min{t : 
Zt 0}. In other words, we will be chasing the robber on the infinite ray R {V{R) = N U {0}, 
E{R) = {{i — : i E N}), which is slightly more difficult for the cop. Hence, it is easy 
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to prove that IE(T„ \ Zq = k) < E(T' | Z'^ = k). Moreover, it is also easy (using a recursive 
argument) to show that E(T' \ Z'^ = k) = k, and so E(T„(fc)) < k. Now we are ready to show 
the following. 

Theorem 3.1. Consider that the cop starts on one end of the path Pn and moves toward the 
other end. Let he the capture time, provided that the robber is drunk. Then, 

n f ^ ( log n\\ n — \ 



2 V V ^ 

Before we move to the proof of this theorem let us mention that, in fact, with a slightly more 
sophisticated argument, it is possible to show that ET„ = njl — 0{\\ 

Proof. Let n eN and fix any c > 2. The robber starts his walk on a vertex A; G {0, 1, . . . , n — 1}. 
Let us note that he is captured after at most n—1 steps of the process (deterministically); that 
is, Tn{k) < n — 1. As we already mentioned KTn{k) < k. Since the starting vertex for the 
robber is chosen uniformly at random, we get that ET^ < Y12=o ^/^ = {n — l)/2, so it remains 
to investigate a lower bound. 

Suppose first that k < (n — l) — Ci/n log n. It follows from Lemma 1231 that the robber reaches 
the other end of the path with probability at most 271^^^^ Z"^. If this is the case, we apply a trivial 
lower bound for Tn{k), namely, Tn{k) > 0; otherwise we get that the (conditional) expectation for 
Tn{k) is equal to k. Hence, ET„(fc) > k(l — 2n^~^ /''). Suppose now that k > (n — l) —Cy/n \ogn. 
Using Lemma [2.31 one more time, we get that with probability at least 1 — 2n^^^ /"^ the robber 
is not caught before time k — cy/n logn. 

Since the starting vertex for the robber is chosen uniformly at random, we get that 

^ n—l 

ET„ > - VeT„(A;) 
n ^-^ 

k=0 





1 


> 






n 


> 


( 







n— 1— cVnlogji n—1 

J2 k+ {k- cy/nk^) I (1 - 2n^-^'/^) 

k=0 fc=n— cVnlogn 



n — 1 



cHogn^ (1-2^1-"'/^). 



For a given n, the parameter c can be adjusted for the best outcome. To get an asymptotic 
behaviour, we can use, say, c = 3 to get that 

n f ^ /logn 



2 V V ^ 

and the proof is complete. □ 

The proof of the theorem actually gives us more. We get that with probability tending to 1 as 
77- — )■ oo, for all starting points for the robber (A; G {0, 1, ... , n—1}), the cop needs k+0{^/n\ogn) 
moves to catch the robber. 

3.2. Paths. We continue studying a visible robber on P„ but we now apply the optimal 
capture strategy (it is optimal for both adversarial and drunk robber). If n is odd, we start by 
placing a cop on vertex {n — l)/2; if n is even we have two optimal strategies, the cop can start 
on n/2 or n/2 — 1. In any case, after selecting an initial vertex the strategy is the same: the cop 
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dct(P„,) > ETl„/2J > - f 1-Of - 



keeps moving toward the robber. Except for initial placement, this is the strategy examined in 
the previous subsection and we have ct(P„) = Yn/2\. We easily get the following result. 

Theorem 3.2. 

In particular, dct(P„) = (1 + o(l))ri/4 and the cost of drunkenness is 

dct(P„) 

Proof. As we already mentioned, after the robber selects his initial vertex to start from, the 
game is played essentially on a path of length at most \n/2\ + 1 . From Theorem 13. we get 
immediately that 

dct(P„) < ETl,/2J+i < n/4. 
For a lower bound, we notice that the length of each subpath is at least \n/2\. By Theorem 13. 11 

\ogn' 
n 

and the proof is complete. □ 

In the general case when A; G N cops are available, we need to 'slice' a path into k shorter 
paths and place a cop on their centers. We get that dct(P„, k) = {1 + o{l))n/{4:k). 

3.3. Cycles. Let us play the game on a cycle C„ for n > 4 {V{Cn) = {1,2,..., n}, E{Cn) = 
{{i, i + 1} : i G {1, 2, . . . , n — 1}} U {{1, n}}). It is not difficult to see that c(C„) = 2; we use 
two cops to chase the robber. They start by occupying two vertices at the distance [(n+ 1)/2J, 
the maximum possible distance on the cycle. When the robber selects his vertex to start with, 
they move toward him and capture occurs at time ct(C„) = \ {n + 1)/4J. The same strategy is 
used when the robber is drunk. 

As for paths, one can introduce a random variable Zt to measure the distance between the 
robber and cops at time t. The problem (almost) reduces to the problem on a path. We mention 
briefly the difference below but the formal proof is omitted. If n is odd, then Zt has exactly the 
same behaviour as before. However, Zq = \_{n + 1)/2J with probability two times smaller than 
any other legal starting value (note that a uniform distribution on V{Cn) is used but there is 
just one vertex at the distance \_{n + 1)/2J). If n is even, then we get a uniform distribution for 
starting values but the transition from Zt to Z^+i is slightly different, namely, there is a chance 
for Zt to stay at the same value, provided that the robber occupies the vertex which is at the 
maximum distance from cops. In any case, it is straightforward to show that both upper and 
lower bounds still hold so we get the following. 

Theorem 3.3. 

n / ^ /logn\\ , ,^ , n + 1 
-(l-Oi^)] < dct(C„) < 



In particular, dct(C„) = (1 + o(l))n/8 and the cost of drunkenness is 

In the general case when e N cops are available, we spread them as evenly as possible. We 
get that dct(Cn, k) = (1 + o{l))n/ (Ak). 
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3.4. Trees. All families of graphs we discussed so far have a very nice property, namely, it is 
clear what the optimal strategy for the cops is. Once players fix their initial positions (that is, 
Xq and Yq), cops must move toward the robber in order to decrease the expected capture time. 
As we mentioned before, it is natural to measure the distance Zt between players at time t; Zt 
decreases by 2 if the robber makes a bad move or is occupying a leaf; otherwise the distance 
remains the same. This applies to the family of trees as well (note that c(T) = 1 for any tree 
T). However, this time it is not clear which vertex should be used for the cop to start with in 
order to optimize the expected capture time. For this family, the random variable Zt decreases 
with probability 1/ deg(f ), provided that the robber occupies vertex f , and the behaviour of the 
sequence {Zt)t>o highly depends not only on the degree distribution but on the structure of a 
tree as well. It is non-trivial to estimate the cost of drunkenness for a particular tree without 
performing extensive calculations for every vertex as a starting point (these calculations can be 
performed by computer, using the algorithms of Section [5.21) . However, some sub- families of 
trees are still relatively easy to deal with. 

Let us consider d regular, rooted tree T{d, k) of depth k. The root vertex on the level 
has d neighbours (children), vertices on levels 1 to — 1 have degree d + 1 (one parent and d 
children), leaves on the level k have degree 1 (just one parent). There are vertices on level 
i for a total of [d^^^ — l)/{d — 1) vertices. Due to the symmetry, the cop must start the game 
on the root. Since the drunk robber prefers to move toward leaves, it is natural to expect that 
his behaviour is similar to the one of the adversarial robber. Moreover, almost all vertices are 
located on levels k — o{k) so the robber almost always starts on these vertices which is clearly 
a good move. We show that the cost of drunkenness is as best as possible; that is, dct{T{d, k)) 
is tending to ct{T{d, k)) = k as k oo. 

Theorem 3.4. 

k-0{^/k\ogk) < dct{T{d,k)) < k. 
In particular, dct{T{d, k)) = {1 + o{l))k and the cost of drunkenness is 



Proof. Suppose that the drunk robber starts on level i > k — ^Jk \ogk. It follows from Lemma l2T2] 
that with probability 1 — 0{k~^) he will be caught on level k — 0{\/k log k). (In fact, it is also 
true for i > k/d, since the robber moves toward leaves with higher rate, namely, with probability 
{d— l)/d. However, an error following from this part is negligible comparing to the other error, 
so we stay with this obvious bound for i.) Therefore, 

dcmd,k))> (#+1 - i)/(d - 1) ~ o{,/kh^)){i - o{k~')) 



=k—\/k log k 



= (1 - Oid-'^^^)){k - 0{^/k\ogk)){l - 0{k-^)) 
= k-0{^k\ogk), 

which finishes the proof. □ 

3.5. Grids. The Cartesian product of two graphs G and if is a graph with vertex set V{G) x 
V{H) and with the vertices (mi, vi) and (m2, "^2) adjacent if either ui = U2 and f 1, f 2 are adjacent 
in if, or vi = f 2 and Ui, U2 are adjacent in G. We denote the Cartesian product of G and H by 
GDH. In this subsection, we will study a square grid PnDPn. 
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It is known that for any two trees Ti,T2, we have c{Ti\I\T2) = 2 [T5]. The capture time of 
the Cartesian product of trees was recently studied in [13]. It was shown that for any two trees 
Ti , T2 we have 

D(TiDT2) DiTi) + D{T2] 
ci{TiUT2) = -^-^ — - = ^ ' ^ — — 

where D = D{G) is the diameter of G. In particular, for a square grid we have that ct(P„nP„) = 
n — 1. 

We will show that the cost of drunkenness for a grid is asymptotic to 8/3. 
Theorem 3.5. 

dct(P„nP„) = (l + o(l))§n, 

o 

and the cost of drunkenness is P(P„nP„) = 8/3 + o(l). 

Proof. Suppose that the drunk robber occupies an internal vertex {u,v). The decision where to 
go from there can be made in the following way: toss a coin to decide whether modify the first 
coordinate (u) or the second one (f ); independently, another coin is tossed to decide whether we 
increase or decrease the value. Hence the robber will move with probability 1/4 to one of the four 
neighbors of {u, v). Note that, if we restrict ourselves to look at one dimension only (for example, 
let us call it North/South direction) we see the robber going North with probability 1/4, going 
South with the same probability and staying in place with probability 1/2. In other words the 
robber performs a lazy random walk on the path. Hence, both coordinates behave similarly to 
the lazy random walk on integers (move with probability 1/2; do nothing, otherwise). The same 
argument as in the previous proofs can be used to show that with probability, say, 1 — o(n~^), 
the robber stays within the distance 0{^/n\ogn) = o{n) from the initial vertex. Hence, if we 
look at the grid from the 'large distance' the drunk robber is not moving at all. 

Therefore, since we would like to investigate an asymptotic behaviour, the problem reduces 
to finding a set 5* consisting of two vertices such that the average distance to 5* is as small 
as possible. Cops should start on S to achieve the best outcome. It is clear that, due to the 
symmetry of P„nP„, there are two symmetric optimal configurations for set S: 

S = {{n/2 + 0(1), n/4 + 0(1)), (n/2 + 0(1), 3n/A + 0(1))}, 
S = {{n/A + 0(1), n/2 + 0(1)), (3n/4 + 0(1), n/2 + 0(1))}. 

In any case, the average distance is 

1 n— 1 



u=0 v=0 

The result follows. 



^^dist{{u,v),S) = {1 + o{l))8n / {x + y)dydx 

„_n ,,-n Jx=0 Jy= 



:i + o(l))-n. 



ly=0 



□ 



4. The cost of drunkenness 



In this section we show that the cost of drunkenness can be arbitrarily close to any real number 
c G [1, 00). In order to do it, we introduce two families of graphs, barbells and lollipops. 
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4.1. Barbell. Let n G N and c > 0. The barbell B{n,c) is a graph that is obtained from 
two complete graphs K^cn\ connected by a path P„ (that is, one end of the path belongs to 
the first clique whereas the other end belongs to the second one). The number of vertices of 
B{n,c) is (1 + 2c)n + 0(1), c{B{n,c)) = 1. In order to catch (either the adversarial or the 
drunk) robber, the cop should start at the center of the path and move toward the robber; 
ct{B{n, c)) = n/2 + 0(1). This family can be used to get any ratio from (1, 2]. 



Theorem 4.1. Let c > 0. Then, 

dct(i?(n,c)) = (l + o(l))|-i±i^, 
and the cost of drunkenness is 



Proof. The drunk robber starts on a clique with probability (2c)/(l + 2c) + o(l). If this is 
the case, the capture occurs at time n/2 + 0{\/n logn) with probability, say, 1 — o{n~^) by 
Lemma 12. 3[ If the robber chooses a vertex at the distance k from the robber to start with, he 
is captured after k + 0{^/n\ogn) steps, again with probability 1 — o{n~^). Hence the expected 
capture time is 

f 2c n 1 n\ 1 + 4c 



l + 2c 2 l + 2c 4/ ' ' ''2 2 + 4c 
The theorem holds. □ 

4.2. Lollipop. Let n G N and c > 0. The lollipop L{n, c) is a graph that is obtained from 
a complete graph i^Lcnj connected to a path P„ (that is, one end of the path belongs to the 
clique). The number of vertices of L{n, c) is (1 + c)n + 0(1), and the cop number c{L{n, c)) is 
1. In order to catch the perfect robber, the cop should start at the center of the path and move 
toward the robber; ct(L(n, c)) =72/2 + 0(1). However, it is not clear what the optimal strategy 
for the drunk robber is. The larger the clique is, the closer to the clique the cop should start 
the game. 

Theorem 4.2. Let c > 0. Then, 

(l + o(l))f ■ (-^-^+f_;f+^-^ force [0,1] 
1 + «(1))2(1T^' forol. 



dct{L{n, c)) = 
and the cost of drunkenness is 



FiLin c)) = = J + ^^1)' ' ^ 1] 

' ' ' dct(L(n,c)) \{l + c) + o{l), forol. 

Before we move to the proof of this result, let us mention that the cost of drunkenness (as 
a function of the parameter c) has an interesting behaviour. For c = it is 2 (we play on the 
path), but then it is decreasing to hit its minimum of 1 + \/2/2 for c = y/2 — 1. After that it is 
increasing back to 2 for c = 1, and goes to infinity together with c. Therefore, this family can 
be used to get any ratio at least 1 + \/2/2 ^ 1.71. 
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Proof. Let the cop start on vertex v at the distance (1 + o{l))bn from the chque (6 G [0, 1] will 
be chosen to obtain the minimum expected capture time). The drunk robber starts on a clique 
with probability c/ (1 + c) + o(l). If this is the case, the capture occurs at time bn + 0{^/n \ogn) 
with probability, say, 1 — o{n^^) by Lemma 12.31 If the robber chooses vertex at the distance 
k from the cop, then he is captured, again with probability 1 — o(n~^), after k + 0{^n logn) 
rounds. The robber starts between the cop and the clique with probability 6/(1 + c) + o(l) and 
on the other side with remaining probability. Hence the expected capture time is equal to 

, / c , h bn 1-b (l-b)n 

(l + o(l)) 6nH \ ^ — 

^ ^ \l + c 1 + c 21 + c 2 

71 

= (l + o(l))^(6^ + (c-l)6+l/2). 

The above expression is a function of b (that is, a function of the starting vertex v for the cop) 
and is minimized at 

, 1 - c ^ 
min < — - — , 



The theorem holds. □ 

It follows immediately from Theorems 13. 4[ 14. and 14.21 that the cost of drunkenness can be 
arbitrarily close to any constant c > 1. 

Corollary 4.3. For every real constant c > 1, there exists a sequence of graphs {Gn)n>i such 
that 

lim F{Gn) = hm -^^^^ = c. 

n-5-oo n-5-oo ClCt(^(jr„j 

5. Computational Aspects 

In this section we deal with computational aspects of the cop against drunk robber problem. 
Our analysis holds for any number of cops, that is, we no longer assume that k = c{G). 

5.1. Computing expected capture time for a given strategy. Suppose that we are 
given a graph and we fix a strategy before the game actually starts. We will now show how to 
explicitly compute the probabihty of capture at time t G {0, 1,2,.. .} as well as the expected 
capture time. 

Fixing a strategy in advance is the best one can do for the invisible robber case (see Section [6]) 
but for a visible one, cops should adjust their strategy based on the behaviour of the opponent; 
this will be treated in the next subsection 15.21 However, the approach presented here is less 
demanding computationally and can be used to provide an upper bound for the optimal expected 
capture time. 

Let G = (V, E) be a connected graph with V = {0,1, . . . ,n — 1}. Letting 
we have 



p^.^l \m\ forjGiV(^) 
otherwise. 



Note that P is the nxn transition probability matrix governing the robber's random walk in G 
in the absence of cops. To account for capture by the cops, define a new state space V = VU{n}, 
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that is, the old state space augmented by the capture state n. The corresponding (n+1) x (n+1) 
transition matrix is 

\^ 1 

In the absence of cops, the robber performs a standard random walk on G and never enters 
the capture state; if however he starts in the capture state, he remains there forever: Pn,n = 1- 
In other words, the Markov chain governed by P contains two noncommunicating equivalence 
classes: {0, 1, . . . , n — 1} and {n}. 

Suppose now that a single cop is located in vertex x. We will denote the corresponding 
transition probability matrix by P{x). Obviously, P (x) ^ P. The difference is caused by the 
possibility of capture, which can occur in two ways. 

(i) At the (i — l)-th round the robber is located at x and, in the first phase of the t-th round, 
the cop moves into x. Then the robber is captured, so P^ n (a;) = 1 and P^ y{x) — ^ for 

(ii) At the {t — l)-th round the robber is located y ^ x and, in the second phase of the 
i-th round, he moves from y to x. Hence the robber is captured with probability Py^x- 
So, for all y e y - {x}, Py^n {x) = Py,x-, Py,x {x) = 0. 

We can summarize the above by writing 



P{x) 



P{x) p(x) 
1 



where P {x) has O's in the x-th row and column and the corresponding probabilities have been 
moved into the p(x) vector. For example, letting G be the path with 5 nodes, the matrices P 
and P (2) are: 
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Especially for the placement round of the game {t = 0) we need a different matrix, because 
the robber does not perform a random-walk, but simply chooses an initial position uniformly 
at random; if he chooses the one already occupied by the cop, then he is captured immediately. 
Hence, for this round the appropriate transition matrix is P (x) , which is the unit matrix with 
the one of the x-th row moved to the (n + l)-th column. 

Let TTi{t) = ¥{Yt = i) iov i eV and te {0, 1, ... , s} and 7r(t) = (7ro(t), 7ri(t), . . . , 7r„(t)); also 
let TT (0) = ^, . . . , ^, O). Then, given a strategy X = (xq, xi, . . . , Xs), the above formulation 
yields 

TT (0) = 5? (0) P (Xo) 

and, for t G {1,2, . . .}, 

TT (t) = n {t - 1)P (Xt-l) . 

This implies that tt (t) =7? (0) P (xq) P (xi) P {X2) . . . P (xt). To illustrate this, let us continue 
the example. Suppose a single cop enters the path and follows the strategy X = (0, 1,2,3,4) 
(start on one end of the path and move to the other one) . Then we have 
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TT (0) = 5f (0) P (xo) = ( 1/5 1/5 1/5 1/5 1/5 ) 



n(l)=n{0)Pix,) = { i i i i i ) 



/ 








V 

/ 



7r(2) = 7r(l)P(a:2) = ( 
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7r(3) = 7r(2)P(x3) = ( ^ | ) 
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( 1 ) 



The elements 7r„(t) give the probabihties P{Xt = n) at time t, that is, the probabihties of 
capture in at most t steps. The probabihties of capture exactly at time t are then given by 
Unit) — 7r„(t — 1). The expected capture time (conditional on strategy X being used) is 



ET 



^t- (7r„(t) -7r„(t- 1)) 
t=i 



In the above example we have 



ET 



1- 



2- 



3- 



4 



31 
20' 



The approach can be generalized to more than one cop, by letting x = (xi,X2, . . . ,Xk) be a 
configuration of cops and defining P(x), P(x) analogously to the one cop case. Given that the 
cops follow the strategy X = (Xi, X2, . . . , X^), the transition probabilities of Y satisfy 

F{Yt = j\Yt_^=t) = P,^{Xt) 

for t < s. So the robber process is an inhomogeneous Markov chain, with the transitions 
controlled by the cops' actions. Markov chains of this type are called Markov Decision Processes 
(MDP) or Controlled Markov Processes, where the control function is Xf, it is a (stochastic) 
control in the sense that it allows us to change the transition probabilities of Yt. We can use 
the MDP formulation to compute ET for any given strategy X in reasonable time. Computing 
the optimal strategy is not computationally viable; for example, with |y| = n and k cops there 
may exist up to 0((n'^)*) strategies of length t (and the same number of corresponding ET's) to 
evaluate. In the Section \^72\ we will present a computationally viable approach to compute the 
strategy that is arbitrarily close to the optimal one. 

MDP's were introduced in the book pLOj; book-length treatments are [3l [T71 [HJ [20] ; an online 
tutorial is [12J. They have been applied to a version of the cops- robber problem in [6]. 

5.2. Computing near-optimal strategies and minimum expected capture time. Let 



us now present and algorithm to compute F{G) 



ct(G) 
dct(G) 



with arbitrarily good precision. Basi- 



cally this reduces to computing ct(G') and a good approximation of dct(G'), which can be done 
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independently. To this end we present two algorithms, both of which have previously appeared 
in the literature. To improve the presentation we assign a name to each algorithm and make a 
few notational modifications; also we point out the similarity between the two algorithms (which 
apparently has not been noticed before). 

(i) The CAAR (Cop ylgainst yldversarial i?obber) algorithm computes ct^.y(G) for every 
initial cop/robber configuration {x,y). In addition, CAAR computes the optimal cop 
and robber play for every {x,y). Capture time ct (G) is easily computed from ct(G') = 
min^ maxy ctx,y{G). 

(ii) Similarly, the CADR{Cop Against Drunk i?obber) algorithm computes (an arbitrar- 
ily good approximation of) dct^^yiG) and the (near-) optimal cop play for every {x,y); 

drunken capture time dct{G) is computed from dct(G) = min^; — 

CAAR was introduced by Hahn and MacGillivray in [9J. We present the algorithm for the 
case of a single cop (the generalization for more than one cops is straightforward). Slightly 
changing notation, we will use C^^y to denote the game duration when the cop is located at x, 
the robber at y and it is the cop's turn to move (in other words, C^^y equals ctx^y{G)). Similarly 
Rx^y denotes game duration when it is the robber's turn to move. For both Gx,y and Rx,y we 
assume optimal play by both cop and robber. Let us also define 

V"^ = Vx V-{{x,x) -.xeV}, 

(that is, V"^ excluding the diagonal) and for all x E V, let (x) = N (x) U {x} be the closed 
neighbourhood of x. CAAR consists of the following recursion (for 2 = 1,2,...): 

y{x,y)eV^ ■.R^:l= max G^J-p , (1) 

^'^ y'&N+{y) ""'y 

W{x,y) eV^ ■.Gi% = l+ min I^)^. (2) 

G and R are initialized with Gf}y = R^x}y = oo for all x ^ y. We take Gx}x = Rx]x = for 
i = 0,1,2,.... Then ([I])-([2]) is essentially equivalent to the version presented by Hahn and 
MacGillivray in [9], with just one difference which we will now discuss. 

In the matrix G is computed iteratively: the {i — l)-th matrix C''*"^-' is stored and used 

in the i-th iteration to compute C*^*-*. In numerical analysis this is known as a Jacobi iteration. 
It is well known that an alternative approach to computations of this type is the Gauss-Seidel 
iteration. In this iteration a single copy of G is stored and its elements are updated "in place." 
In [9j , Hahn and MacGillivray present the Jacobi version of CAAR and prove that the algorithm 
converges (in a finite number of steps) if and only if c(G) = 1. Hence CAAR computes the 
solution of the equations 

M {x,y) eV'^ : Rx,y = max C^y, (3) 

y'&N+(y) 

V (x, y) eV"^ : Gx,y = 1 + min Rx',y, (4) 

x'^N+{x) 

MxeV: Gx,x = Rx,x = 0. (5) 

The interpretation of the equations is the following. Equation captures the property that 
from configuration [x, y) the robber moves so as to maximize the length of the game; similarly, 
(jlj) describes the cop's goal to minimize the game duration (since the cop moves in the first 
phase of each round, 1 time unit must be added to min/?^/ ,^); finally says that the game 
ends when cop and robber occupy the same vertex. 
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Extending the CAAR idea to the drunk robber game, let us now use C^^y to denote dct^^yiG). 
In other words C^^y (respectively, Rx,y) is the expected game duration after the cop's (respec- 
tively, robber's) move. Recall (see Subsection 15. ip that Py^yi{x) is the probability of the robber 
transiting from y to y', given that the cop is at x; note that P{x) is a suhstochastic matrix. The 
analog of is 

V(x,2/)GV^^:i?« = Py^y (^) (6) 

y'eN{y) 

y{x,y)eV':Ci% = l+ min (7) 

and the analog of ([3])- ([5]) is 

y{x,y)eV^:R{x,y)= J] P,,,, (x) C,,,', (8) 

y'(^N(y) 

V (x, y)eV^ : C^^y = 1 + min R^,^y. (9) 

x'i^N+(x) 

\/xeV : C^,^ = i?^,^ = 0. (10) 

We want ([6])-([7j) to converge to the solution of fl8|)- f|T0|) . We will discuss convergence conditions 
(and initialization) presently. 

Actually ([6])-([7j) can be simplified. Since the drunk robber does not choose his moves, we can 
eliminate Rx]y from ([6])- ([7]) and obtain the CADR algorithm recursion: 

V (x, y)eV': C» = 1 + min [ Py,, (x') C^^r^^ ) . (11) 

\y'&N{y) J 

We have derived ffTTl) from ([6])- ([7]), which we see as an analog of ([I])- ([2]). However, we will now 
show that ffTTj) is a version of the value iteration algorithm, introduced and studied in the MDP 
literature [21 |T7l UHl |20]. Consider a general MDP process with state space S, action space A, 
transition matrix Q and cost matrix G{a) (that is, Gg^s' [o-) is the cost of transition s — )■ s' using 
action a). The state space satisfies = S't U Sa, where St are the transient states and Sa the 
absorbing ones; it is assumed that transitions after absorption have zero cost: Gg^s' (c^) = for 
s,s' G Sa- Let Cs be the expected total cost of the process starting from state s and continuing 
until absorption. Then [18] C satisfies the equations 

yseST:Cs = min [ G,,,, (a) + Q^^s' (a) C^' ) (12) 

and the solutions to (fT2|) can be obtained by the following value iteration: 

^seSr: = min ( G,.y (a) + V Q,,,, (a) Cfr'^ ) . (13) 

To show that ffT^ can be reduced to ffTT]) let us take St = V"^ and A = V; in other words, states 
s = {x,y) are cop/robber configurations and actions a = x' are new cop positions. Regarding 
move costs: (a) before capture every move has unit cost, (b) after capture only moves of the 
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form {x,x) — )■ {x,x) are possible and these have zero cost; in short 




_ J 1 if and only if x ^ y 
1 otherwise. 




Using the above, it is easy to reduce (|T3l) to (fTTl) . 

The convergence of the CADR algorithm has been studied by several authors, in various 
degrees of generality [HI [TOl |20]. A simple yet strong result, derived in [6], uses the concept of 
proper strategy: a strategy is called proper if it yields finite expected capture time. It is proved 
in |6j that: if a proper strategy exists for graph G, then the Gauss-Seidel version of CADR 
converges to the true C for arbitrary C*^°^ provided Cl^l > for all {x, y) & V^. As we have seen 
in Theorem 12. the cop has a proper strategy for every G. It can be proved that the Jacobi 
version of CADR also converges under the same conditions. 

Now, F{G) can be computed, easily. For every pair {x,y), one can obtain a desired approxi- 
mation of ct x^y{G) and dctxy{G) by performing CAAR and CADR, respectively. Then 



Both CAAR and CADR can be generalized for the case of k cops, replacing x by a fc-tuple 
X = {xi,X2, . . . , Xk)', however, execution time of both algorithms increases exponentially with k, 
hence the algorithms are computationally viable only for small fc's. Also CADR will work for any 
transition probability matrix P, not just for random walks. Hence, if desired, we can compute the 
cost of drunkenness for any number of cops (not just for k = c{G)) and for non- uniform random 
walks (i.e., discrete time birth-and-death processes) and other kinds of Markovian robbers. 

Both CAAR and CADR can easily provide an optimal and near-optimal cop strategy in 
feedback form U^^y, that is, the optimal cop move when the cop/robber configuration is {x,y). 
This is achieved by recording a minimizing x' in (jl]) / ffTTj) . The optimal robber strategy W^^y 
(for the adversarial robber) can be similarly obtained by CAAR. For every (x, y) configuration 
we can have more than one optimal moves, but they all yield the same (optimal) game duration. 

We have implemented the CAAR and CADR algorithms in the Matlab package CopsRobber, 
which can be downloaded from [13]. We have used this package to perform a number of numerical 
experiments, some of which are presented in the technical report [21]. This report also contains 
presentation of the algorithms in pseudo-code and a discussion of various computational issues. 



In this section we present an introductory discussion of the cops and robber game when the 
robber is invisible; in other words, the cops do not know the robber's location unless he is 
occupying the same vertex as one of the cops. All the other rules of the game remain the same. 
This version raises several interesting questions, a full study of which will be undertaken in a 
future paper. 

Since the cops never see the robber until capture, they cannot use feedback strategies. In other 
words, the cop strategy is determined before the game starts. This does not mean that every cop 
move is predetermined because in certain cases it makes sense for the cops to randomize their 



F{G) 



ct(G) 
dct (G) 



min^gv- Eyev ^^^^y ' 



min^jgy maxyev ct^y (G) 



6. The Invisible Robber 
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moves. Hence capture time will in general be a random variable, even in the case of adversarial 
robber (who may also benefit from a randomized strategy). 

Let us first examine the case of adversarial invisible robber. It is clear that, given enough 
cops, expected capture time will be finite. This is obviously true for \V\ cops, but in fact c{G) 
cops suffice, as seen by the following theorem. 

Theorem 6.1. Suppose that c{G) cops perform a random walk on a connected graph G, starting 
from any initial position. The robber, playing perfectly, is trying to avoid being captured. Let 
random variable T be the capture time. Then, 

ET < oo. 

Proof. Let G = {V,E) be any connected graph, and let A = A{G) be the maximum degree of 
G. Put k = c{G). For any configuration of cops x & V'' and any vertex occupied by the robber 
y E V, there exists a winning strategy S^^y that guarantees that the robber is caught after at 
most tx,y rounds. It is clear that cops will follow Sx,y with probability at least (1/A)'^*^'^. Now, 
let us define 

e= min (1/A)'=*^'" = (1/A)'=^» > 0, where Tq = max t^^y. 

This implies that, regardless of the current position of players at time t, the probability that 
the robber will be caught after at most Tq further rounds is at least e. Moreover, corresponding 
events for times t,t + To,t + 2To, . . . are mutually independent. Thus, we get immediately that 



ET = ^P(T>t) < ^p(t> ^ Tn 

= 5^ToP(T>2To) < ToJ2i^-ey = ^ < oo, (14) 

i>0 i>0 ^ 

and we are done. □ 





t 


(t > 







Hence c{G) is the minimum number of cops required to capture the adversarial invisible 
robber in finite expected time, since this task is at least as hard as capturing the adversarial 
visible robber. Of course, generally it will take longer, comparing to the visible robber case, to 
capture the invisible robber. Let us define ict x^y{G, k) to be the expected capture time when the 
initial cops/robber configuration is (x, y) and both the k cops and the robber play optimally; 
we also define 

ict(G', k) = min max ict^; y{G, k) 

and, finally, ict(G) = ict{G,c{G)). 

We now turn to the drunk invisible robber. He chooses his starting vertex uniformly at 
random and performs a random walk, as before. For a given starting position x E V'' for k 
cops, there is a strategy that yields the smallest expected capture time idcta;(G', k). Cops have 
to minimize this by selecting a good starting position: 

idct(G', k) = min idct2,.(G, k). 

As usual, idct(G) = idct(G, c(G)) but it makes sense to consider any value of A; > 1. The proof 
of the next theorem is exactly the same as Theorem 12.11 and so is omitted. 

Theorem 6.2. idct(G,k) < oo for any connected graph G and k >1. 
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Finally, the cost of drunkenness for the invisible robber game is Fi{G) = j^^^- It follows 
from last theorem that this graph parameter is well defined (that is, finite). 

Let us make a few remarks regarding the invisible robber with ^Hnfinite" speed (actually, 
what we mean by this is an arbitrarily high speed). Let us define the cop number for this case 
by c°^{G); it is the minimum number of cops that have a strategy to obtain a finite expected 
capture time. It is clear that c{G) < c°°{G) < s{G), where s{G) is the search number of G, that 
is, the minimum number of cops required to clean the graph in the Graph Search (GS) game 
(mentioned in Section [1]). We want to emphasize that the cops and robber game (with invisible, 
infinite speed robber) is different from the GS game and, in particular, there are graphs for 
which c°°{G) < s{G). For example, for the C3 cycle, s{Cs)=2 but c°°{G) = 1, namely one cop 
using a randomized strategy, can capture the invisible, adversary, infinite speed robber in T with 
ET = 2. Similarly, one cop on Ki ^, the star with 3 rays, can achieve ET = 11/3. Many other 
examples can be found. The main reason for the discrepancy between c°^(G) and s{G) is that, in 
the GS game, the fugitive is assumed omniscient and (under one interpretation) this means he 
knows in advance all the cop moves (until the end of the game). In the cops and robber family 
of games, on the other hand, omniscience is not assumed, either explicitly or implicitly. We 
can summarize in one phrase: clearing is harder than capturing even an infinite speed robber. 
We intend to further explore this issue, as well the computation of optimal strategies for cops 
chasing an invisible adversarial robber in a future publication. 

We will finish this section with the computation of the cost of drunkenness for two examples 
(path and cycle) involving an invisible (unit speed) robber. In both cases the computation is 
possible because the optimal strategy (for both the cops and the adversarial robber) is "obvious." 
Our examples are similar to the ones we have considered for the visible robber and proofs are 
omitted, since they are almost identical to those of Section [31 

Consider the path Pn again, with a single cop and an invisible robber. It is clear that the 
best strategy for the cop (regardless of whether he is playing against a perfect robber or a drunk 
one) is to start from one end of the path (say, from vertex 0) and move along the path until the 
robber is captured. We have ict(Pn) = n — 1. When cops are playing agains a drunk robber, 
the expected capture time is roughly two times smaller. 



n f ^ ( log n\\ , / ^ X n — 1 

'1-0^ < idct(P„) < ^— . 



Theorem 6.3. 

2 V" " V ^ 

In particular, idct(Pri) = (1 + o{\y)nj1 and the cost of drunkenness is 

Let us now play the game with two cops and an invisible robber on the cycle C„ for n > 4. 
It is not difficult to see that s(C„) = 2 = c{Gn)- The best cop strategy is to start on vertices 1 
and n; the cop occupying vertex 1 will move toward higher values, the other one will move in 
the opposite direction. The game ends after ict(Cn) = [(n — 1)/2J steps. When cops are playing 
against a drunk robber, the expected capture time is roughly two times smaller. 



Theorem 6.4. We have 

4 V~ " \ n 



"^_of!^^))<idct(c„)<^ 



SOME REMARKS ON COPS AND DRUNK ROBBERS 



19 



In particular, idct(Cn) = (1 + o(l))n/4 and the cost of drunkenness is 

7. Conclusion 

Most of the results in the paper pertain to the case of a visible (adversarial / drunk) robber, 
pursued hj k = c{G) cops. The cases of arbitrary k and invisible robber have been briefly 
touched. We conclude the current paper by listing additional questions regarding the cost of 
drunkennes. We begin by listing several questions related to the visible robber. 

(i) Our analysis can be expanded to strategies which use an arbitrary number of cops. As 
shown in Theorem 12.11 even a single cop can catch a drunk robber in finite expected 
time. Hence, for a given G we can study dct(G', k) as a function of k. Obviously this is 
a decreasing function; what more can be said about it? As a first step in this direction, 
the numerical approach of Section [5] can be used to explore the properties of dct(G', k) 
for a given graph G. 

(ii) Let us define dct(G, X) to be the expected capture time in graph G using strategy X. It 
is no longer assumed that X is an optimal strategy. Under what conditions on X and/or 
G will dct(G',X) be finite? Can we use the approach of Section O to obtain non-trivial 
bounds on dct(G, X)? 

(iii) A related question is whether (for a specific G and either optimal or general strategies) 
expected capture time can be connected to some graph parameter such as treewidth, 
pathwidth etc. 

(iv) How robust are our results to slight (natural) modifications of the cops/robber game 
rules? For example, would the cost of drunkenness change if we allowed the robber to 
loop into its current location (that is, to perform lazy random walk)? What about a 
"general" random walk (that is, with nonuniform transition probabilities). What about 
directed graphs? Finally, does the situation change significantly if the cops and the robber 
move simultaneously rather than the cops moving first? The algorithm of Section |5] can 
be easily modified to handle these cases and numerical experiments may be useful for an 
initial exploration. 

One can try to obtain similar results for the invisible robber. In Section |6] we showed how our 
approach can be extended (at least for certain families of graphs) to this case. In the examples 
we examined (paths, cycles) the optimal cop strategy is obvious. For general graphs, finding the 
search strategy optimal for the invisible (adversarial / drunk) robber will be more complicated. 
Is there a (computationally viable, perhaps approximate) algorithm to achieve this? 

Finally, let us note that all of the above analyses adopt the cops' point of view. It will 
be interesting to study the cost of drunkenness for the cops. In other worlds, assuming an 
adversarial evader and k drunk cops, can we place bounds on the increase of expected capture 
time as compared to the case of adversarial cops? Theorem 16.11 may be used as a starting point 
to achieve this goal. 
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